Scaffold Topologies. 2. Analysis of Chemical Databases

نویسندگان

  • Michael J. Wester
  • Sara N. Pollock
  • Evangelos A. Coutsias
  • Tharun Kumar Allu
  • Sorel Muresan
  • Tudor I. Oprea
چکیده

We have systematically enumerated graph representations of scaffold topologies for up to eight-ring molecules and four-valence atoms, thus providing coverage of the lower portion of the chemical space of small molecules (Pollock et al. J. Chem. Inf. Model., this issue). Here, we examine scaffold topology distributions for several databases: ChemNavigator and PubChem for commercially available chemicals, the Dictionary of Natural Products, a set of 2742 launched drugs, WOMBAT, a database of medicinal chemistry compounds, and two subsets of PubChem, "actives" and DSSTox comprising toxic substances. We also examined a virtual database of exhaustively enumerated small organic molecules, GDB (Fink et al. Angew. Chem., Int. Ed. 2005, 44, 1504-1508), and we contrast the scaffold topology distribution from these collections to the complete coverage of up to eight-ring molecules. For reasons related, perhaps, to synthetic accessibility and complexity, scaffolds exhibiting six rings or more are poorly represented. Among all collections examined, PubChem has the greatest scaffold topological diversity, whereas GDB is the most limited. More than 50% of all entries (13 000 000+ actual and 13 000 000+ virtual compounds) exhibit only eight distinct topologies, one of which is the nonscaffold topology that represents all treelike structures. However, most of the topologies are represented by a single or very small number of examples. Within topologies, we found that three-way scaffold connections (3-nodes) are much more frequent compared to four-way (4-node) connections. Fused rings have a slightly higher frequency in biologically oriented databases. Scaffold topologies can be the first step toward an efficient coarse-grained classification scheme of the molecules found in chemical databases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scaffold Topologies. 1. Exhaustive Enumeration up to Eight Rings

Mapping the chemical space of small organic molecules is approached from a theoretical graph theory viewpoint, in an effort to begin the systematic exploration of molecular topologies. We present an algorithm for exhaustive generation of scaffold topologies with up to eight rings and an efficient comparison method for graphs within this class. This method uses the return index, a topological in...

متن کامل

The Molecule Cloud - compact visualization of large collections of molecules

BACKGROUND Analysis and visualization of large collections of molecules is one of the most frequent challenges cheminformatics experts in pharmaceutical industry are facing. Various sophisticated methods are available to perform this task, including clustering, dimensionality reduction or scaffold frequency analysis. In any case, however, viewing and analyzing large tables with molecular struct...

متن کامل

Ligand based lead generation - considering chemical accessibility in rescaffolding approaches via BROOD

In pharmaceutical industry ligand based approaches like scaffold hopping, scaffold decoration and me-too approaches, are used to generate lead structures in discovery projects. We use several tools to generate novel lead structures, such as BROOD [1]. BROOD is a software tool which explores chemical space around query molecules based on shape similarity and electrostatics, and it generates anal...

متن کامل

The controlled release of dexamethasone sodium phosphate from bioactive electrospun PCL/gelatin nanofiber scaffold

In this study, a system of dexamethasone sodium phosphate (DEXP)-loaded chitosan nanoparticles embedded in poly-ε-caprolacton (PCL) and gelatin electrospun nanofiber scaffold was introduced with potential therapeutic application for treatment of the nervous system. Besides anti-inflammatory properties, DEXP act through its glucocorticoid receptors, which are involved in the inhibition of astroc...

متن کامل

Scaffold Hunter: Facilitating Drug Discovery by Visual Analysis of Chemical Space

The search for a new drug to cure a particular disease involves to find a chemical compound that influences a corresponding biological process, e.g., by inhibiting or activating an involved biological target molecule. A potential drug candidate however does not only need to show a sufficient amount of biological activity, but also needs to adhere to additional rules that define the basic limits...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and modeling

دوره 48 7  شماره 

صفحات  -

تاریخ انتشار 2008